PRE-PROCESSING OF BIT RATE ALLOCATION IN A MULTI- 
CHANNEL VIDEO ENCODER 



BACKGROUND OF THE INVENTION 

The present invention relates to a method and 
apparatus for bit rate allocation in a multi-channel 
video data encoder. The invention relates generally 
to statistical multiplexing, wherein a bit rate 
(e.g., bandwidth) is allocated to the different 
channels based on the channels' bit rate needs and 
the overall available bandwidth. 

Statistical multiplexing is the process of 
encoding a number of signals at variable bit rates 
and combining the variable-rate bitstreams into a 
single fixed-rate transport stream so that the 
bandwidth allotted to each signal is flexible and 
varies with each signal's bit rate need. 
Conventionally, an estimate of bit rate need is made 
based on signal statistics. After a bit rate is 
allocated based on the need, the data in each signal 
is compressed and encoded using a specific 
quantization level. The amount of data that results 
from the compression is examined in each channel, 
and the quantization level is adjusted so that 
channels with more encoded data receive a higher bit 
rate. Next, the video data is compressed and 
encoded again using the adjusted quantization level. 
The process may be repeatedly successively in 
multiple feedback cycles. Other conventional 
techniques attempt to equalize a quantization 
distortion measure across the channels. 



However, the conventional techniques have 
various drawbacks . For example, the use of 
successive feedback cycles in the compressor can be 
time-consuming and computationally intensive. 
5 Additionally, special bit rates needs for specific 

types of video scenes may not be considered. 
Moreover, the equalization of a quantization 
distortion measure does not reliably translate to an 
equalization of perceived image quality. 

10 Accordingly, it would be desirable to provide a 

high-performance dynamic rate allocation system that 
quickly and accurately allocates bit rate to a 
plurality of video channels to equalize the overall 
image quality of all channels at any time instant. 

15 The system should provide a pre-processor which 

measures statistical information of the video data 
prior to compression and encoding to estimate the 
relative bit rate required to adequately encode each 
video scene. The measurements should be made 

20 sufficiently early in the encoding process to 

eliminate undesirable time delays. The system 
should provide the allocated bit rate to the video 
compressor from the pre-processor in a feedforward 
path to avoid undesirable feedback. The system 

25 should also provide the capability for feedback 

processing to fine tune the allocated bit rate 
before providing it to the compressor. 

Furthermore, the system should measure or 
detect at least some of the following 

30 characteristics of each video frame (e.g., picture): 

spatial activity, temporal activity, image size, 



frame rate, scene change, brightness, flash, fade, 
and horizontal pixel resolution. The system should 
bias the bit rate allocation according to inter- 
frame distance, whether the average spatial activity 
5 level is below a lower threshold, whether the inter- 

frame distance is above an upper threshold or below 
a lower threshold, whether the quantization of 
previous frames is above an upper threshold, the 
length of the Group of Pictures (GOP) , and a user- 

10 selectable priority factor. 

The system should also allocate any surplus bit 
rate, if any, among the channels, to avoid having 
unused bandwidth. 

The system should be compatible with 

15 progressive or interlaced video, as well as 

different image shapes and sizes, including Video 
Object Planes (VOPs) . 

The system should further be compatible with 
different video standards including NTSC, PAL, and 

20 NTSC detelecine. 

The system should provide a pre-processor which 
can be used with existing commercially available 
compression circuitry to allow quick and inexpensive 
retrofitting of such circuitry. 

25 The present invention provides a system having 

the above and other advantages. 



SUMMARY OF THE INVENTION 



The present invention relates to a method and 
apparatus for bit rate allocation in a multi-channel 
video data encoder, 

A method for allocating a bit rate to a 
plurality of variable rate video channels in a video 
encoder includes the steps of: processing video 
data from a current picture (e.g., frame) in each 
respective channel to determine at least a spatial . 
activity and a temporal activity thereof; and 
determining a bit rate demand D x for each current 
picture according to the associated spatial activity 
and temporal activity. 

The method is suitable for use with multiplexed 
channels that are all variable rate, as well as with 
a combination of fixed rate and variable rate 
channels . 

The method may include the further step of 
adjusting the bit rate demand D ± for each current 
picture according to whether at least one of a scene 
change, fade and flash is detected for the current 
picture. Generally, bit allocation is increased if 
any of these events are detected since such events 
cannot usually be efficiently coded. 

The method may include the further steps of 
increasing the associated temporal activity of each 
current picture if the associated spatial activity 
is below a lower threshold; and adjusting the bit 
rate demand Di for each current picture according to 
the increasing step. This is done since motion 
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within a scene with a low spatial activity will 
produce an artificially small inter-frame 
difference . 

The method may include the further steps of 
5 increasing the bit rate demand D 1 for each current 

picture when the associated temporal activity 
exceeds an upper threshold; and/or decreasing the 
bit rate demand D x for each current picture when the 
associated temporal activity is less than a lower 
10 threshold. This is done since high motion scenes 
require additional bits to maintain a given image 
quality while fewer bits are required for low motion 
scenes . 

The method may include the further steps of 
15 determining a quantization level of at least one 

previous picture for each current picture; and 
increasing the bit rate demand Di for each current 
picture when the quantization level of the at least 
one previous picture exceeds an upper threshold. 
20 This is done to avoid oscillations in the 

quantization level that may be noticeable to a 
viewer . 

Each current picture may be part of an 
'associated Group Of Pictures (GOP) , where each GOP 

25 typically includes one or more intra-coded pictures 

and several inter-coded pictures. In this case, the 
method may include the further steps of decreasing 
the bit rate demand Di for each current picture when 
a length of the associated group of pictures exceeds 

3 0 a nominal level; and/or increasing the bit rate 

demand B x for each current picture when a length of 
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the associated group of pictures is less than a 
nominal level. This is done since fewer bits are 
required to code a large GOP since there are 
relatively more inter-coded (e.g., predictive coded 
5 pictures), such as B- and P-pictures. 

The method may include the further step of 
reducing or eliminating the increase or decrease of 
the increasing and decreasing steps, respectively, 
when the temporal activity of each current picture 

10 exceeds an upper threshold . This is done since high 

motion is likely to result in relatively more intra- 
coded pictures in a GOP. 

The method may include the further step of 
adjusting the bit rate demand Di for each current 

15 picture according to a horizontal pixel resolution 
thereof. This is done since more bits are required 
to code a higher resolution picture. 

The method may include the further steps of 
determining a brightness level for each current 

20 picture; and increasing the bit rate demand D x for- 
each current picture when the associated brightness 
level is less than a lower threshold. Darker scenes 
should be coded with additional bits to maintain a 
perceived image quality. 

25 The method may include the further step of 

adjusting the bit rate demand D x for each current 
picture according to priority factor thereof that 
indicates a relative importance of each current 
picture in the multiplexed data stream. Thus, more 

30 important channels, such as movies and pay per view 



7 



events, for example, may be allocated additional 
bits to provide an enhanced picture. 

Moreover, the allocated bit rate for each 
current picture used in the encoding step may be 
5 determined in a single iteration or in successive 

iterations . 

The method may comprise the further steps of 
determining an overall available bit rate for 
transmitting the current pictures in a multiplexed 

10 data stream; determining an allocated bit rate for 

each current picture according to a ratio of bit 
rate demand of each current picture and a sum of the 
bit rate demands from each current picture; and 
providing the allocated bit rate for each current 

15 picture to respective video data compressors for 

compressing the respective current pictures to 
obtain compressed video data for transmission in 
said multiplexed data stream. 

When the allocated bit rate for each current 

20 picture is determined in a plurality of iterations 

including an initial iteration and at least one 
successive iteration, the method may include the 
further steps of determining a bit rate surplus or 
deficit between the overall available bit rate and 

25 a sum of the allocated bit rates for each current 

picture in the initial iteration; and allocating the 
surplus or deficit among at least some of the current 
pictures according to a ratio of bit rate demand of 
the at least some of the current pictures and a sum 

30 of the bit rate demands of the at least some of the 

current pictures in the at least one successive 
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iteration. A satisfactory final bit rate can be 

converged on usually in about three total iterations. 
A corresponding apparatus is also presented. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates a multi- channel video 
encoder in accordance with the present invention. 

FIG. 2 illustrates conceptually how bit rate 
5 demand is determined for a video channel in 

accordance with the present invention. 

FIG. 3A is Part A of a flowchart illustrating 
how bit rate demand D^ is determined in accordance 
with the present invention. 
10 FIG. 3B is Part B of the flowchart of FIG. 3A 

illustrating how bit rate demand Di is determined in 
accordance with the present invention. 

FIGs 4, 4A and 4B illustrate how macroblock 
activity is calculated in accordance with the 
15 present invention. 

FIG. 5 is a flow chart illustrating how 
allocated bit rate R x is determined for several 
channels of a multi-channel video encoder. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a method and 
apparatus for bit rate allocation in a multi-channel 
video data encoder. 
5 FIG. 1 illustrates a multi-channel video 

encoder in accordance with the present invention. A 
multi-channel video encoder shown generally at 100 
includes a number N of channel encoders 111, 121, . 
. . , and 131. Each channel encoder, also referred 

10 to as an application, includes a pre-processor and a 

compressor. These components need not be physically 
separate but may be implemented in shared hardware, 
firmware and/or software. However, in a 
particularly advantageous embodiment of the present 

15 invention, the pre-processor circuitry is used with 

existing commercially available video compression 
circuitry to allow quick and easy retrofit of an 
existing channel encoder. 

A first channel encoder 111 includes a pre- 

20 processor 110 and a compressor 140, while a second 

channel encoder 121 includes a pre-processor 120 and 
a compressor 150, and an Nth channel encoder 131 
includes a pre-processor 130 and a compressor 160. 
Each compressor may be implemented using a 

25 commercially available video processor, such as 

those available from C-Cube Microsystems, Milpitas, 
California, USA. 

Each compressor performs conventional 
compression and encoding steps such as motion 

30 compensation and estimation, transform coding (e.g., 
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using the Discrete Cosine Transform) , and Huffman 
encoding. Pixel data from a first, second and Nth 
channel are provided to pre-processors 110, 120 and 
130, respectively. Each pre-processor may operate 
5 in parallel, and performs calculations using the 

respective pixel data to determine a bit rate demand 
Di for each ith channel. A corresponding bit rate 
demand signal is provided from each pre-processor to 
a central control 170. For example, pre-processor 

10 110 may communicate with the central control 17 0 via 

line 114, while pre-processor 120 communicates with 
the central control 170 via line 124, and pre- 
processor 130 communicates with the central control 
170 via line 134. The central control 170 may be 

15 implemented as a micro-processor using known 

circuitry. Moreover, each pre-processor 110, 120, 
130 may communicate with the central control 170 via 
a bus in a time-sharing manner. Each pre-processor 
may also communicate with the respective compressor. 

20 The central control 170 processes the bit rate 

demand signals D ± to determine an allocated bit rate 
Ri for each ith channel as explained in greater 
detail in FIG. 5 below. The respective allocated 
bit rate signals R L/ where i=l, 2, . . .N, are provided 

25 to the compressor 140, compressor 150, and 

compressor 160 via lines 144, 154, and 164, 
respectively. Each compressor also receives the 
pixel data from its respective channel. For 
example, compressor 140 receives the channel "1" 

30 pixel data via line 112, while compressor 150 

receives the channel "2" pixel data via line 122, 
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and compressor 160 receives the channel N pixel data 
via line 132. 

Each compressor compresses and encodes the 
respective pixel data according to the allocated bit 
5 rate R x , which is provided as a feedforward signal 

from the pre-processor to the respective compressor. 
Once the control 170 receives the Di values from the 
pre-processors, the control calculates a final R x in 
one or more iterations, and provides the final R x 

10 value to the compressors. 

The pixel data which is compressed and encoded 
at the respective compressors at the allocated bit 
rate is then provided to a multiplexer (MUX 180) to 
provide a time multiplexed data stream (e.g., 

15 transport stream) on line 182. Specifically, the 

MUX 180 receives compressed pixel data from 
compressor 140 via line 142, from compressor 150 via 
line 152, and from compressor 160 via line 162. The 
MUX 180 also receives a control signal from the 

20 central control 170 via line 172 for synchronizing 

the multiplexing of the video data from the 
respective channels. The MUX 180 and control 170 
need not be physically separate but may be 
implemented in shared hardware, firmware and/or 

25 software. 

FIG. 2 illustrates conceptually how bit rate 
demand is determined for a video channel in 
accordance with the present invention. The bit rate 
demand Di from the ith channel is an indication of 

30 the complexity of the video source being compressed. 

Note that the terms "bit demand" and "bit rate 
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demand" may be used interchangeably, as the bit rate 
demand is simply the bit demand in a specific time 
interval. At block 210, the spatial activity of a 
current video frame is calculated. The spatial 
5 activity, or intra-frame activity, is an average 

macroblock activity for the frame as discussed in 
greater detail below in FIGs 4, 4A and 4B. 

Note that the term "frame" or "picture" as used 
herein refers generally to any image, including a 

10 particular field of interlaced video, a progressive 
mode frame, or even a Video Object Place (VOP) , 
which is an image in a sequence of generally 
arbitrarily-shaped images. VOPs are discussed in 
the MPEG-4 standard. 

15 At block 220, a temporal or inter-frame 

activity of the current video frame is calculated. 
At block 230, the scene change, fade, and flash 
detection is made. At block 240, the average 
quantization step size for the current video frame 

20 is determined. The scene change, fade, and flash 

detect block 230 receives data inputs from the 
temporal activity block 220 and quantization step 
size block 240. At block 250, the brightness of the 
current frame is determined, while at block 2 60, the 

25 frame rate is determined, and at block 270, the 

current image size is determined. Information from 
each of the blocks 210, 220, 230, 240, 250, 260 and 
270 is then provided to the bit rate demand block 
280 to determine D x . 

30 At block 220, the temporal activity of the 

current frame is determined as the sum of the 
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absolute pixel differences between the current and 
previous frames* Each pixel difference is 
determined for correspondingly situated pixels 
within the respective frames. Referring to block 
240, the quantization step size is the result of 
compressing a prior picture at a previously- 
determined bit rate. At block 230, the scene 
change, fade, and flash detect may be performed 
using any known technique. For example, a fade may 
be detected if the average picture brightness is 
decreasing over a number of frames. Moreover, a 
flash may be detected when the picture brightness 
increases rapidly above an upper threshold. 

At block 250, the brightness is determined as 
the average pixel luminance in the current frame. 
At block 2 60, the frame rate is determined according 
to the interval between the current and previous 
frames. The frame rate may vary, for example, 
between standard NTSC video, with thirty frames per 
second, PAL video at twenty-five frames per second, 
and NTSC detelecine video at an average of twenty- 
four frames per second. 

At block 255, a length of the group of pictures 
(GOP) is also provided as an input for determining 
the bit rate demand At block 270, the image 

size of the current frame is determined based on the 
horizontal resolution, or number of pixels per line. 

FIG. 3A is Part A of a flowchart illustrating 
how bit rate demand Di is determined in accordance 
with the present invention. FIGs 3A and 3B 
illustrate an exemplary process that is used by the 
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pre-processors in determining a bit rate demand D x 
for the respective video channel. It will be 
appreciated that the different steps may be 
performed in different sequences, that not every 
5 step is required, and that additional steps which 

are not shown may be performed. At block 302, the 
average spatial activity SA avg is calculated for the 
current frame. The calculation of SA avg is discussed 
further in connection with FIGs 4, 4A and 4B, below. 

10 In block 304, the average spatial activity is 

optionally mapped to a more convenient range. At 
block 306, SAavg is clipped to remove especially high 
or low values. The actual values used for clipping 
can be determined* by experimentation. At block 308, 

15 the initial bit rate demand for the ith channel, Di, 

is set to the average spatial activity value. 

At block 310, the inter-frame distance INTER dist 
is calculated as the sum of the absolute differences 
of the luminance pixel values between the current 

20 and previous frames. INTER dist is a measure of 

temporal activity for the current frame. At block 
312, a scene change, fade, and flash detection is 
made. At block 314, the bit rate demand D x is 
adjusted based on whether a fade is detected. 

25 Specifically, if a fade is detected, Di may be 

increased by a predetermined amount that can be set 
by experimentation. Di is increased since coding of 
fades is generally inefficient. That is, additional 
bits are required to maintain a given video quality 

30 level. Similarly, at block 316, D x is adjusted 

based on whether a flash is detected for the current 
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frame. If a flash is detected, D x is increased by a 
predetermined amount that can be determined by 
experimentation . 

At block 318, if SAa Vg is less than a lower 
5 threshold, INTER dist is increased. This is done 
since motion within a scene with a low spatial 
activity will produce an artificially small inter- 
frame distance. At block 320, if INTER dl5t is above 
an upper threshold which can be determined by 

10 experimentation," D± is increased. This is done 

since the inventors have determined that very high 
levels of inter-frame activity are relatively more 
noticeable and therefore require additional bits to 
maintain a given image quality level. Conversely, 

15 as block 322, if INTER dlst is below a lower threshold 
which can be determined by experimentation, is 
decreased since scenes with little motion are 
encoded more efficiently, so fewer bits can be used 
for coding. 

20 At block 324, if the previous quantization 

level, QUANTp r evious, is greater than an upper 
threshold, which can be determined by 
experimentation, D x is increased. This is done to 
avoid oscillations in the quantization level that 

25 may be noticeable to the viewer. Additionally, note 
that since the quantization level may vary with the 
picture type, the quantization level should be taken 
only from similar picture types, e.g., from B- 
pictures or P-pictures. Moreover, the quantization 

3 0 level can be run through a fast attack, slow decay 

function to avoid fluctuations in the bit rate. 
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Such a function can be implemented ina filter that 
allows a rapid increase in the quantization level 
but only a slow decrease. Thus, QUANT previ ou S may 
represent a value that is a function of the actual 
5 quantization level of one or more previous frames. 

Next, at block 32 6, the length of the GOP is 
used to modify the bit rate demand Di . Note that 
according to the MPEG-2 or similar video standards, 
a group of pictures includes at least one I-picture, 
10 which is intra coded without reference to any other 

picture, followed by a number of inter-coded or 
predicted pictures such as B-pictures and P- 
pictures . 

A B-picture is a bi-directionally predicted 

15 picture which uses both previous and future pictures 

(in presentation order) for prediction. A P-picture 
uses only a previous picture for prediction. 
Therefore, with successive GOPs, each having one I- 
picture, a frequency of I pictures in the data 

20 stream can be determined. Therefore, at block 32 6, 

if the nominal GOP is more than fifteen pictures in 
length, Di should be decreased. This accounts for 
the relationship between the required bit rate and 
the frequency of I-pictures. 

25 That is, the frequency of I-pictures is 

generally inversely proportional to the length of 
the GOPs. Moreover, for a given picture quality, 
fewer bits are needed to code pictures in a larger 
GOP since there are relatively more inter-coded 

30 pictures, and inter-coded pictures (B-pictures and 

P-pictures) require fewer bits to code than I- 
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pictures. Conversely, more bits are needed to code 
pictures in a smaller GOP since there are relatively 
fewer inter-coded pictures* 

The nominal GOP length for adjusting the bit 
5 rate need not be fifteen pictures, but this value 

has been determined by the inventors to be 
5 satisfactory. It is possible to use higher and/or 

S lower GOP length thresholds. 

d FIG. 3B is Part B of the flowchart of FIG. 3A 

2 10 illustrating how bit rate demand Dj. is determined in 

3 accordance with the present invention. At block 

^ 328, if the length of the GOP is less than fifteen 

3- pictures, an increase is set for Di. At block 330, 

p ; the magnitude of the increase or decrease of 

Z 15 based on the length of the GOP is reduced if the 

J inter-frame distance INTER dist is greater than an 

4: " upper threshold which can be determined by 

experimentation. That is, the magnitude of the 
adjustment is reduced when the inter-frame activity 
20 is high since a large amount of motion is likely to 

cause intra-coding in all pictures, thereby 
countering the effect of the I- frame frequency. The 
decrease of block 326 and the increase of block 328 
may even be eliminated for very high levels of 
25 inter-frame activity. At block 332, D x is modified 

by the adjusted increase or decrease, if any. 

At block 334, Di is adjusted based on the 
horizontal resolution of the current frame. 
Specifically, D L is increased if the horizontal 
30 resolution is greater than a baseline level, and D 1 

is decreased if the resolution is less than the 
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baseline level. For example, if the baseline level 
corresponds to standard NTSC video, additional bits 
will be allocated for High Definition Television 
(HDTV) . At block 336, if the brightness is less 
5 than a threshold that can be determined by 

experimentation, Di is increased. The bit rate is 
increased for relatively dark scenes, since such 
scenes generally are not coded efficiently. At 
block 338, if a scene change is detected, is set 

10 to a maximum value. This is done since scene 

changes generally are also not coded efficiently, 
even more so than dark scenes. 

At block 340, D x is clipped between maximum and 
minimum values and linearly scaled to a 

15 predetermined range, e.g. 50 to 700. 

At block 342, the allocated bit rate Di is 
optionally further adjusted by a priority factor 
according to the relative importance of the video 
channel. Assuming a base line priority factor of 

20 "1", for example, a relatively more important video 

channel will have a priority factor greater than 
"1", while a relatively less important video channel 
will have a priority factor less than "1". For 
example, it may be desirable to assign a relatively 

25 high priority factor to movies and special sporting 

events, while a relatively low priority factor is 
assigned to common news broadcasts. The priority 
factor may be specified by an operator using a 
keyboard or other input device that communicates 

30 with the central control 170 of FIG. 1. Also at 
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block 342, Di is normalized, for example, by 
dividing by 128 . 

At block 344, the final bit rate demand D ± is 
now available for use by the central control 170 in 
5 allocating the bit rate R x for each ith channel as 

set forth in FIG. 5 and the accompanying discussion 
below. 

FIGs 4, 4A and 4B illustrate how macroblock 
activity is calculated in accordance with the 

10 present invention. As each picture in encoded, an 

activity value is generated for each macroblock in 
the current picture. Each 16 x 16 luma value 
macroblock is subdivided into 2x2 tiles, and the 
values in each tile are examined. In each tile, the 

15 absolute difference in luma values between one 

corner (either the top right or bottom left) and its 
horizontal and vertical neighbors is taken. For 
example, in FIG. 4 a macroblock shown generally at 
400 includes tiles 410 and 420. Tile 410 includes 

20 pixel "a ,T 412, pixel "b" 414, pixel "c" 416 and 

pixel "d" 418. The tile activity is the absolute 
value of b-a plus the absolute value of b-d. Tile 
420 includes a pixel "a" 422, pixel "b" 424, pixel 
"c" 426, and pixel "d" 428. The tile activity is 

25 the absolute value of c-a plus the absolute value of 

c-d. 

The absolute difference terms for each tile are 
summed to give the macroblock activity. The 
macroblock activity values are then summed across 
3 0 the entire current picture and divided by the total 

number of macroblocks in the picture to obtain the 
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picture's average spatial activity. The average 
spatial activity may then be reported to the pre- 
processor. 

Of course, the activity calculation procedure 
5 shown may be modified in various ways. For example, 

the tile size may be varied along with the selected 
reference pixel in each tile. 

FIG. 5 is a flow chart illustrating how 
allocated bit rate R x is determined for several 

10 channels of a multi-channel video encoder. Once the 

bit rate demand Di has been determined by the pre- 
processors for each video channel as explained in 
connection with FIGs 3A and 3B, it is forwarded to 
the central control 170, as shown in FIG. 1. The 

15 central control 170 uses the demand signals D x to 

determine corresponding allocated bit rate signals 
Rx for each ith channel. Generally, one or more 
iterations may be used by the control to arrive at a 
final R x . That is, an initial R L may be fine tuned 

20 in successive iterations. 

Referring to FIG. 5, at box 500, the central 
control solicits the applications to obtain the bit 
rate demands d, e.g., every ten msec. The term 
"application" is used generally to refer to each 

25 channel encoder, which comprises the combination of 

a pre-processor and compressor for each channel. At 
box 505, if an application does not respond to the 
control's request within a predetermined amount of 
time, for example, eighty msec, the control marks 

30 that application as "dead" or "inactive" and sets D i 

to a minimum value. An application may or may not 
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respond to a solicitation based on whether it has 
computed new need information. That is, the 
application need not respond to a solicitation if it 
has not computed a new need value, Dj_. The control 
5 retains the most recent need value from each 

application. The application remains in a minimum 
rate status until it responds to a subsequent 
control solicitation in which case the application 
is marked as alive and is allocated the currently 

10 determined rate Ri. 

Each time the updated bit rate needs D x have 
been received from at least one-half of the alive 
VBR applications, for example, the control executes 
the bit allocation procedure. Upon completion of 

15 the procedure, R L is sent to each application even 

if the application's new rate is the same as the old 
rate. The control and MUX implement the transport 
stream schedule according to the specified fixed 
rates, if any, and the calculated variable rates. 

20 At box 510, the sum of the bit rate requests 

from the dead applications is subtracted from an 
available bit rate. The available bit rate may be a 
fixed amount that is known a priori or may vary with 
time. The available bit rate is the overall amount 

25 of channel bandwidth that is currently allocated for 
transmitting the data from each VBR video channel. 
This available bit rate may correspond to a 
transport bit rate minus the fixed rate allocations 
for any fixed rate channels that are multiplexed 

30 with the VBR channels. At block 515, the remaining 

bit rate is allocated to the alive or active 
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applications in proportion to their need values. At 
block 520, each allocated rate is examined to see if 
it exceeds the application's specified minimum or 
maximum rate. If so, the allocated rate is clipped 
5 to the appropriate minimum of maximum limit. 

At block 525, any surplus and deficit bit rate 
from each channel is collected (e.g., summed) to 
form a net surplus or net deficit bit rate. A net 
surplus occurs when the available bit rate is 

10 greater than the sum of the allocated bit rates, 

while a net deficit occurs when the opposite is 
true. Note that a surplus for one channel can 
offset a deficit for another channel. At block 530, 
if a net surplus is present, any application at its 

15 maximum rate is eliminated from further adjustments 
until the next iteration of the process, if any. 
Similarly, at block 545, if a net bit rate deficit 
is present, any application at its minimum rate is 
eliminated from further adjustments until the next 

20 iteration of the process, if any. 

At block 535, the net surplus bit rate is 
reallocated among the remaining applications in 
proportion to their need values. Similarly, at 
block 550, the net deficit is reallocated among the 

25 remaining applications in proportion to their need 

values. At block 555, if there is no net surplus or 
net deficit present, the allocated bit rate R x is 
communicated from the central control 170 to each 
compressor. The video data is compressed and 

30 encoded according to the final value of R x and 
communicated to the MUX 180. 
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At block 540, the reallocation of the net 
surplus or net deficit may be repeated in successive 
reiterations if a net surplus or net deficit is 
still present and the bit rate has not been fixed at 
5 a minimum or maximum value for all applications. 

The process may be repeated at block 525 for an 
additional two iterations, for example. Limiting 
the process to a maximum of three iterations is done 
to limit computation time. However, it should be 

10 appreciated that the present invention may also be 

implemented with one or more iterations. 

If a net surplus or net deficit exists after 
the third iteration, it is distributed equally among 
all applications that remain available for 

15 adjustment; that is, the applications that are not 

dead or fixed. At block 560, the final allocated 
bit rate R x is communicated to the compressors for 
each ith channel. 

An operator may control the bandwidth allocated 

20 to each application via bit-rate setting. Any 

application can be set to a fixed bit rate. A video 
encoder can alternatively be set to a variable bit 
rate (VBR) through the specification of a minimum 
and maximum bit rate. The minimum transport bit 

25 rate required for a multi-channel video encoder is 

equal to the sum of the fixed bit rates plus the sum 
of the minimum VBR rates, that is: 

MinimumNecessaryBitrate= iFixedRate + ZVBRMinRates . 
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Similarly, the maximum usable bandwidth is the sum 
of the fixed bit rates plus the sum of the maximum 
VBR rates: 

MaximumUseableBitrate= ZFixedRate + IVBRMaxRates . 

5 If the available transport bit rate is less than the 

minimum required bit rate, the system cannot comply 
with the specified configuration. If the available 
transport bit rate is greater than the maximum 
useable bit rate, then all VBR applications run at 

10 their respective maximum rates, and no statistical 

multiplexing will occur. If the available transport 
rate is between the minimum and maximum bit rates, 
statistical multiplexing occurs, and the entire 
available transport bandwidth is allocated among the 

15 different channels. 

Prior to encoding each picture, each VBR 
channel encoder checks for a new allocated bit rate, 
Ri, from the control. If there is a new rate, the 
channel encoder is commanded to compress and encode 

20 the channel data at the new rate. 

Regarding timing of the rate control scheme, as 
discussed, a frame activity value is calculated in 
the respective channel encoders and made available 
to the control at the frame rate. For standard NTSC 

25 encoding, the update interval is every 33 msec, 

while for PAL it is every 40 msec, and for NTSC 
with detelecine, the time between updates is 33 
msec, and 50 msec. 
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Preferably, the control executes its bit 
allocation process each time it has updates from at 
least one-half of the VBR applications. This should 
occur no less frequently than every fifty msec. 
5 As can be seen, the present invention provides 

a system for allocating a bit rate to a plurality of 
variable rate channels in a video encoder according 
to the bit rate demand from each channel and the 
available bit rate. Advantageously, a bit rate 
10 demand D x is determined in a pre-processor for each 

channel prior to compressing and encoding the data 
for transmission. The allocated bit rate R x may be 
determined from the D x values in one or more 
iterations . 

15 The bit rate demand Di accounts for various 

characteristics of the current picture data in each 
channel, including spatial activity, temporal 
activity, image size, frame rate, scene change, 
brightness, flash, fade, and horizontal pixel 

20 resolution. The system also biases the bit rate 

allocation according to inter-frame distance, 
whether the average spatial activity level is below 
a lower threshold, whether the inter-frame distance 
is above an upper threshold or below a lower 

25 threshold, whether the quantization of previous 

frames is above an upper threshold, the length of 
the Group of Pictures (GOP) , and a user-selectable 
priority factor. The system also allocates any 
surplus bit rate among the channels to avoid having 

30 unused bandwidth. 
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Although the invention has been described in 
connection with various specific embodiments, those 
skilled in the art will appreciate that numerous 
adaptations and modifications may be made thereto 
5 without departing from the spirit and scope of the 
invention as set forth in the claims. 



